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ABSTRACT 

A current project is underway to develop a multimedia 
system that would support the teaching and learning of Chinese 
characters, as well as provide a platform for conducting research 
into the cognitive aspects of Chinese language acquisition. Although 
the number of commonly used Chinese characters amounts to thousands, 
there are many structural commonalities and regularities among the 
characters. One of the objectives in developing the system is to 
build a knowledge-base representing the structural features of and 
relationships among Chine&e characters, where these structures can be 
made more explicit and explorable to the learner. The computer, with 
its interactive graphic ability, can provide a medium where character 
structures can be presented with flexible highlights directing 
learners' attention to various components; it can also provide a 
variety of activities where the Itarner can practice disassembling 
and assembling logographic components of Chinese characters. It is 
envisaged that through interactions with such environments, the 
learner would be able to develop a structural understanding about 
Chinese characters and acquire effective strategies for learning 
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Abstract: This paper reports on a current project to develop a multi-media 
system that would support the teaching and learning of Chinese characters, as well as 
provide a platform for conducting research into the cognitive aspects of Chinese 
language acquisition. Although the number of Chinese characters commonly in use 
amounts to thousands, there are in fact many structural commonalities and 
regularities among the characters, which relate these characters into a highly 
connected system. One of the objectives in developing the present system is to build 
a knowledge-base to represent these structural features of and relationships among 
Chinese characters, so as to support the development of interactive learning 
environments where these underlying structures of Chinese characters can be made 
more explicit and explorable to the learner. It is envisaged that through interactions 
with such environments, the learner would be able to develop a structural 
understanding about Chinese characters aad acquire effective strategies for learning 
them. 



This project intends to help young first language learners, from pre-primary to early primary levels, to learn 
Chinese characters. In learning a character, the learner has to learn its written form, its pronunciation and its 
meaning. There are reasons to believe that before the school begins to teach these children the written form of 
Chinese, the children have already acquired, from their daily life, a rich mental lexicon for the language, in its 
audio form (Kwong 1992). So the major tasks to be achieved in teaching these characters are (1) to provide 
effective means for the learner to encode the written form of the characters, and (2) to help the learner to relate 
these written forms to sounds and meanings already existing in their mental lexicon (Huang & Liu 1978). In the 
following sections, the authors will examine how a computer system may provide support for this teaching and 
learning process. 

Effective Learning of the Written Form of Chinese Characters 
Ideographic Nature of Chinese Characters 

Chinese characters are not just arbitrary aggregates of strokes. Many Chinese characters are pictographs 
standing for objects, or pictographs with certain markings added to indicate more abstract concepts 
(Shuowenjiezi, 100,Weiger 191 1; Hung & Tseng 1981). 

Research shows that at an early stage in the learning of Chinese characters, the learner tends to remember 
characters as distinct pictures (Chuang 1975). In one earlier project on software for learning Chinese characters 
carried out by the team (Lam 1993), computer animation was used to relate the written form of some Chinese 
(J characters to their pictorial origins. Preliminary evaluation indicated that such presentation is motivating as well 

as effective m helping young learners to remember the written form as well as the meanings of the characters 
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Figure 1. Chinese characters as pictorial and indicative symbols 
Chinese Characters as Structured Compositions 

Most Chinese characters are composite logographic forms (Shuwenjiezi, 100). In these "compound 
characters" two or more major components are combined to fill an imaginary square block. The methods of 
composition are highly regular. The components are reusable in producing different characters, and many of 
them are themselves characters. Some basic methods of composition are shown in figure 2, each illustrated with 
a compound character of that format. 



Figure 2. Chinese characters as structured compositions 

In general, the components contribute to the sound or the meaning of the compound character they form 
(Ann r982). In some cases, the components are themselves characters, and the meanings of these simple 
characters add together to form the meaning of the compound character. 

B + n = m 

sun tnoQn 

tr«« tr*« fonrt 

Figure 3. Characters representing an aggregate of meanings contributed by its components 

In other cases, one component represents the meaning while the other component the sound of the 
compound character. This is the so called pictophonetic composition. Over 70% of Chinese characters are 
compositions of this sort. 

Th« chtraclen h»v« rimflir 

•fR ^Pi nn jounds. The component B 

dejT fine invite contriintei to their sound, 
weethcr 

The other components 
contribute to their meanings. 



water sun «»y 



Figure 4. Examples of characters formed by pictophonetic composition 

If one further examines these components, one will notice that they can in fact be further decomposed into 
simpler reusable stroke patterns. These we shall refer to as "graphemes". Although this decomposition to the 
graphemic level often does not add knowledge about the meaning or sound of the character, familiarity with the 
Tet of commonly used graphemes do help people in their recognition and writing of the characters or components 
(Huang 1992). 
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Structured Encoding of Chinese Characters by Learners 



In studies conducted by Shen Yeh and Tsao Chuan Yung (1963, 1965). it was found that there were two 
major developmental changes in the process of mastering Chinese characters by primary school children: the first 
one was found immediately after the fu-st month of learning Chinese characters and the other was between the 
first and second grades. With the second transition, the ability to master unfamiliar characters grew rapidly. 

According to reported experience of some local school teachers, learners at grade two begin to demonstrate 
some mental processes of graphemic analysis and synthesis. Having leamt the characters ^ (brother) and O 
(son), a learner may identify the reusable grapheme When presented with a new character ^ (insert), the 
learner may be able to describe verbally the component i as a =p (thousand) on top of "a ^ without the ' ' ' ". 
This ability to generate new logographic constructs to chunk new information would help to reduce significantly 
the memory load required in encoding the form of new characters (Hung & Wang 1992). Furthermore, some 
learners can guess that the character means some action, because they know other characters like ^ (hit), ^ 
(push), (pull) also have the same component j and they all mean some actions. 



Using Computers to Facilitate Structural Perception and Understanding 

of Chinese Characters 

Although research seems to show structural encoding to be an important process in the learning of Chinese 
characters, current practice in primary grades is to drill learners in the stroke-by-stroke writing of characters 
(CDC 1990). Much of the learner's attention is thus directed to the individual strokes, while the development of 
structural perception of Chinese characters is very much left to his/her own initiative. 

In this project, we see great potentials for the computer to provide useful learning experience, beyond the 
conventional handwriting exercises, for the learner tc develop a structural perception of Chinese characters The 
computer, with its interactive graphic capability, can provide a medium where character structures can be 
presented with flexible highlights directing learners' attention to the various components. TTie computer can also 
provide a rich variety of activities where the learner can practise disassembling and assembling logographic 
components of Chinese characters. 

More importantly, the computer can allow the learner to leam the written form of Chinese characters with 
more meanings and interconnections. As pointed out earlier, there are many regularities in the construction of 
Chinese characters and the rich information encoded in the written form of a Chinese character may include cues 
about Its meaning and sound. Hypertext systems can be designed such that characters related tlirough particular 
structural features or components can be made readily accessible for the learner to examine. Characters can be 
learned in rich relationship with one another and learners can traverse from one character to the next through a 
shared component. Learners may even explore, like linguists, the regularities in the stnicture of Chinese 
characters. Knowing about the components of a character becomes an exciting pathway for information access 
and concept formation. In exploring this hypertext, learners can construct their own theories about the stnicture 
of Chmese characters. Such an approach to the learning of Chinese characters will be referred to as the 
structural constructivist approach. 

Although Chinese characters are ideographs and meanings are attached to individual characters such 
meanmgs are often not too specific. Most nouns, verbs and adjectives in Chinese are in fact made up of two or 
more characters. Such linguistic units will be referred to here by their original name in Chinese: ci m In many 
mstances, a seem to correspond more to words in English, while Chinese characters to monosyllabic 
morphemes (Hoosain 1992, Huang & Liu 1978). Ci ususally have meanings much more specific than characters 
For example, the idea of a policeman, corresponding to the ci is concrete and commonplace in everyday 
life, while the character ^ carries a more general or abstract notion of to notice or to examine A possible way 
for young learners to leam the meaning of a character would be to explore the meaning of a number of 
commonly used ci containing the character and make their own induction. Here the hypertext capability of 
computer can again play an important role. 
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A Knowledge-based Multimedia System to Support the Learning of Chinese Characters 



This section describes the design of the system we are currently developing. It is intended to be a flexible 
and extensible platform for the development of computer-aided-learning (CAL) software for Chinese character 
learning, as well as research into cognitive aspects of Chinese language learning. The system is designed to 
support, in particular, teaching and learning in the structural constnictivist approach. 

Conceptually, the system consists of five levels. The bottom layer of the system is a knowledge-base about 
structures, relationships, and meanings of Chinese characters. The second level is a database which records the 
frequency, extent of use and competence of a student in relation to the different elements in the knowledge-base. 
The third level is a set of application software providing access to the Chinese character knowledge-base (first 
level) and the student progress database (second level) for use by teachers and students. The fourth level is a set 
of courseware generation tools for use by teachers to generate customized CAL software. The fifth level is a set 
of CAL materials developed by tools in the lower levels for teaching various topics about Chinese characters. 

The System Backbone: The Chinese Character Knowledge-base and Its Basic Fuiicttons 

The Chinese character knowledge-base is built to represent the structural features and relationships among 
Chinese characters. It consists of two modules, one responsible for the logographic information of characters, and 
the other responsible for the meanings and sounds of characters. 

The logographic information motfu/e contains a database of logograms, including r\aracters, components, 
sraphemes and strokes. It records how each logogram is stepwise-decomposable into other simpler logograms, 
and the composition method. In the case of pictophonetic composition, it also makes note of the specific 
subcomponents contributing respectively to the sound and the meaning of the character. Pictorial origins of 
characters are stored as graphic animation where appropriate. 

The information stored will support the following operations: 

(a) to give a stroke-by-stroke calligraphic display of the character. 

(b) to provide a stroke ordering exercise where the learner can choose the correct stroke sequence in a 
character with continuous feedback from the computer. 

(c) to display the character, highlighting in turn the different components and subcomponents. 

(d) to support jigsaw-like exercises where the learner is asked to assemble characters from their components. 

(e) partial matching to search for a character based on partial information about its structures and components 



(f) to present the pictorial origins of some characters. The presentation starts with a realistic picture, and 
gradually transforms to a skeleton, and then to the written form of the character. The method is similar to 
that used in Lam (1993) 

In the sound and meaning module, each character node is linked to a set ofc/ illustrating the meaning of the 
character. A voice file and a phoneme are stored for the pronunciation of each character in the knowledge-base. 
For the special cases where a character is pronounced differently when it is used in different senses, a number of 
voice files and phonemes will be stored. For some of thee/, additional resources are stored including images and 
voice files of sample sentences. Further, for each character and ci, there is information about its meaning 
category and difficulty level. 

Such information is to support the following functions in this module: 

(a) to display related ci to illustrate the meaning(s) of the character. 

(b) to read aloud the characters and ci in the knowledge-base. 

(c) to read aloud example sentences or display pictures for a particular ci. 

(d) to search for characters that are pronounced identically. 

(e) to search for characters based on partial information on phonemes. 




Tlui %¥ni return characters like: 
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(0 to search for ci according to their meaning categories, and difficult)' levels. 

To start with, the system will contain information for 500 Chinese characters selected from those speciled in 
the school curriculum for young first language leamers (CDC 84, 90). According to our analysis, we envisage 
that within these 500 characters, the structure and basic graphemes necessary for the learning of most Chinese 
characters would have been covered. xVe hope that by learning these 500 characters in the way supported by the 
system, the mental "tool-kit and library" can be developed so that the learner will be able to learn other Chinese 
characters proficiently (Shiu & Lau 1982, Tseng & Wang 1983). 



The Other Levels of The System 



The second level - the student progress database.Tht limit.ation of any teaching/learning material that does 
not take the learner's past experience and competence into account is widely recognized. The objective here is to 
set up a database that records for each student his/her usage profile and achieved competence as demonstrated by 
his/her interactions with the system. Data will be automatically logged into this database when the learner uses 
the various modules at the higher levels. This database can be used to provide feedback to students, to allow 
teachers to monitor the progress of students, and for usp by the software generators at level 4 to' generate 
appropriate CAL software. 

The third level - database tools. This is a set of application software that allows both teachers and students to 
modify, customize and access the Chinese character knowledge-base and the student progress database. 
For the teacher, there are four types of application software: 

1. The knowledge-base construction tool - This is a user-friendly interface provided to allow modification 
and extension of the knowledge-base. Teachers may add in new characters, structures, sense nodes,c/, 
together with voice and image resources, and even new meaning categories. This is seen as an extremely 
important feature. It means that teaching of Chinese characters through the system need not be confined 
by the exact contents of the knowledge-base as provided. Teachers can change it and such changes can 
automatically be reflected in all the applications built on the knowledge-base. 

2. The teacher quen,' tool - This is a tool provided to teachers to query the knowledge-base for lesson 
planning and courseware development purposes. 

3. The teacher customization tool - As with printed dictionaries and thesauruses, different versions will be 
required for different levels of users. The customization tool allows the teacher to index the knowledge- 
base into different levels of difficulty and allow student access to a designated portion only. 

4. The reporting tool - This is for teachers to monitoi the progress of individual students. It reports on the 
profile of usage: modes of usage, parts of the knowledge-base traversed, and performance in cases where 
the student has taken exercise or tests (level 4 and 5 applications) from the system. 

For the student, there are three types of tools: 

1. The student query tool - It may be used as a general reference tool by the student. The basic functions 
have been outlined in the previous section about the knowledge-base. Teachers may design specific 
learning tasks or research projects that require the use of the knowledge-base, or students may be allowed 
to call up this tool for help while they are working on other software on the system. . 

2. The student customization tool - The purpose of this tool is to allow the student to create a personalized 
knowledge base of Chinese characters and ci that he/she wants to retain for easy access in future. 
Furthermore, the student may construct his/her own data-structure for organizing language elements 
(which in itself offers great potential for research into cognitive features in Chinese language learning). 

3. The reporting tool - This is similar to the teacher reporting tool. It allows the student to find out how much 
he/she has learnt and mastered for self-monitoring purposes. 

The fourth level - CAL generation tools. This is a set of software generation tools that can produce courseware 
m specific formats. The teacher just has to choose the courseware type, and specify the required language 
elements to be used, how he/she would like to draw reference from the student progress database, and how the 
student progress database should be updated according to the student's performance. The courseware will be 
automatically generated. One would envisage instruction like the following would be sufficient to generate a 
CAL program: "Generate a program on assembling characters with two components, in the form of a cat-catch- 
mouse game, using the set of characters in the knowledge-base that has been visited by the learner. Add one point 
to the learner's structured-perception index to a character if he/she succeeds in assembling the character." 
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The fifth level - CAL materials. The fifth level is a set of example courseware and self-learning materials, 
both cotiiputer-based and non-computer based, developed using the tools provided at the lower levels. It 
illustrates how the generic tools in level 4 can be use to achieve specific learning objectives. It mcludes learning 
materials that introduce new characters andc/. materials that focus on particular topics about Chinese characters 
or particular language skills, games for enhancing interest or for reinforcement, and tests. While these 
courseware are being used on the computer, the system will be able id update the student progress database at the 
second level of the system. Also, the learner can have the option of moving from the courseware to the student 
query tool for reference or help. 

Conclusion 

This paper reports on the considerations made in the development of a multi-media system that would support 
the teaching and learning of Chinese characters for young first language learners. The logographic and 
morphemic nature of Chinese characters are discussed, and a structural constructive model of leammg is 
described The knowledge-base is currently under construction and the application programs are now m the 
process of detail design. It is envisaged that the evaluation of such a system in classrooms is going to provide 
useful feedback on the various theoretical assumptions about the cognitive processes in Chinese language 
learning discussed in the paper. 
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